Search CORE

22 research outputs found

Cross-lingual Incongruences in the Annotation of Coreference

Author: Hardmeier Christian
Krielke Pauline
Lapshinova-Koltunski Ekaterina
Loáiciga Sharid
Publication venue: Saarländische Universitäts- und Landesbibliothek
Publication date: 01/01/2019
Field of study

In the present paper, we deal with incongruences in English-German multilingual coreference annotation and present automated methods to discover them. More specifically, we automatically detect full coreference chains in parallel texts and analyse discrepancies in their annotations. In doing so, we wish to find out whether the discrepancies rather derive from language typological constraints, from the translation or the actual annotation process. The results of our study contribute to the referential analysis of similarities and differences across languages and support evaluation of cross-lingual coreference annotation. They are also useful for cross-lingual coreference resolution systems and contrastive linguistic studies

Acronym

It-disambiguation and source-aware language models for cross-lingual pronoun prediction

Author: Guillou Liane
Hardmeier Christian
Loáiciga Sharid
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2016
Field of study

Crossref

Edinburgh Research Explorer

Analysing concatenation approaches to document-level NMT in two different domains

Author: Loáiciga Sharid
Scherrer Yves
Tiedemann Jörg
Publication venue: The Association for Computational Linguistics
Publication date: 01/11/2019
Field of study

In this paper, we investigate how different aspects of discourse context affect the performance of recent neural MT systems. We describe two popular datasets covering news and movie subtitles and we provide a thorough analysis of the distribution of various document-level features in their domains. Furthermore, we train a set of context-aware MT models on both datasets and propose a comparative evaluation scheme that contrasts coherent context with artificially scrambled documents and absent context, arguing that the impact of discourse-aware MT models will become visible in this way. Our results show that the models are indeed affected by the manipulation of the test data, providing a different view on document-level translation quality than absolute sentence-level scores.Peer reviewe

Helsingin yliopiston digitaalinen arkisto

What is it? Disambiguating the different readings of the pronoun 'it'

Author: Guillou Liane
Hardmeier Christian
Loáiciga Sharid
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2017
Field of study

Crossref

Edinburgh Research Explorer

Annotating tense, mood and voice for English, French and German

Author: Fraser Alexander
Friedrich Annemarie
Loáiciga Sharid
Ramm Anita
Publication venue
Publication date: 06/07/2023
Field of study

We present the first open-source tool forannotating morphosyntactic tense, mood and voice for English, French and German verbal complexes. The annotation is based on a set of language-specific rules, which are applied on dependency trees and leverage information about lemmas, morphological properties and POS-tags of the verbs. Our tool has an average accuracy of about 76%. The tense, mood and voice features are useful both as features in computational modeling and for corpuslinguistic research

OPUS Augsburg

Forms of Anaphoric Reference to Organisational Named Entities: Hoping to widen appeal, they diversified

Author: Bevacqua Luca
Hardmeier Christian
Loáiciga Sharid
Rohde Hannah
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2018
Field of study

Crossref

Edinburgh Research Explorer

A Pronoun Test Suite Evaluation of the English--German MT Systems at WMT 2018

Author: Guillou Liane
Hardmeier Christian
Lapshinova-Koltunski Ekaterina
Loáiciga Sharid
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2018
Field of study

Crossref

Edinburgh Research Explorer

Event versus entity co-reference: Effects of context and form of referring expression

Author: Bevacqua Luca
Hardmeier Christian
Loáiciga Sharid
Rohde Hannah
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2018
Field of study

Crossref

Edinburgh Research Explorer

Cross-lingual Incongruences in the Annotation of Coreference

Author: Hardmeier Christian
Krielke Pauline
Lapshinova-Koltunski Ekaterina
Loáiciga Sharid
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2019
Field of study

Crossref

Edinburgh Research Explorer

Universaar

Acronym

Findings of the 2017 DiscoMT Shared Task on Cross-lingual Pronoun Prediction

Author: Christian Hardmeier
Jörg Tiedemann
Mauro Cettolo
Preslav Nakov
Sara Stymne
Sharid Loáiciga
Yannick Versley
Publication venue
Publication date: 01/01/2017
Field of study

We describe the design, the setup, and the evaluation results of the DiscoMT 2017 shared task on cross-lingual pronoun prediction. The task asked participants to predict a target-language pronoun given a source-language pronoun in the context of a sentence. We further provided a lemmatized target-language human-authored translation of the source sentence, and automatic word alignments between the source sentence words and the targetlanguage lemmata. The aim of the task was to predict, for each target-language pronoun placeholder, the word that should replace it from a small, closed set of classes, using any type of information that can be extracted from the entire document. We offered four subtasks, each for a different language pair and translation direction: English-to-French, Englishto-German, German-to-English, and Spanish-to-English. Five teams participated in the shared task, making submissions for all language pairs. The evaluation results show that all participating teams outperformed two strong n-gram-based language model-based baseline systems by a sizable margin

Archivio della ricerca - Fondazione Bruno Kessler